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Abstract 

We discuss nonparametric estimation of the distribution function G{x) of the autoregressive coefficient 
a G (—1,1) from a panel of N random-coefficient AR(1) data, each of length n, by the empirical distribution 
function of lag 1 sample autocorrelations of individual AR(1) processes. Consistency and asymptotic 
normality of the empirical distribution function and a class of kernel density estimators is established 
under some regularity conditions on G{x) as N and n increase to infinity. The Kohnogorov-Smirnov 
goodness-of-fit test for simple and composite hypotheses of Beta distributed a is discussed. A simulation 
study for goodness-of-ht testing compares the finite-sample performance of our nonparametric estimator 
to the performance of its parametric analogue discussed in [T]. 
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1 Introduction 

Panel data can describe a large population of heterogeneous units/agents which evolve over time, e.g., house¬ 
holds, firms, industries, countries, stock market indices. In this paper we consider a panel where each 
individual unit evolves over time according to order-one random coefficient autoregressive model (RGAR(l)). 
It is well known that aggregation of specific RGAR(l) models can explain long memory phenomenon, which 
is often empirically observed in economic time series (see [9] for instance). More precisely, consider a panel 
{Xi(t), t = 1,..., n, z = 1,..., N}, where each Xi = {Xi{t),t G Z} is an RGAR(l) process with (0, cr^) noise 
and random coefficient a* G (—1,1), whose autocovariance 

EX^{0)X^{t) = / --2dG(x) (1.1) 

J-i 
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is determined by the distribution function G{x) = Pr(a < x) of the autoregressive coefficient. Granger [9j 
showed, for a specific Beta-type distribution G{x), that the contemporaneous aggregation of independent 
processes {Xj(f)}, i = 1,..., A^, results in a stationary Gaussian long memory process {dl(t)}, i.e., 

N 

N-^G^Xi{t) ^fdd X{t) asN^oo, (1.2) 

i=l 

where the autocovariance EX{0)X{t) = EXi(0)Xi(t) decays slowly as t —)■ oo so that Ylt&z 

A natural statistical problem is recovering the distribution G{x) (the frequency of a across the population of 
individual AR(1) ‘microagents’) from the aggregated sample {X{t), t = 1,... ,n}. This problem was treated 
in [aEmg. Some related results were obtained in muni HI]. Albeit nonparametric, the estimators in mm 
involve an expansion of the density g = G'm an orthogonal polynomial basis and are sensitive to the choice 
of the tuning parameter (the number of polynomials), being limited in practice to very smooth densities g. 
The last difficulty in estimation of G from aggregated data is not surprising due to the fact that aggregation 
per se inflicts a considerable loss of information about the evolution of individual ‘micro-agents’. 

Clearly, if the available data comprises evolutions {Xi(t), t = 1,..., n}, z = 1,..., iV, of all N individual 
‘micro-agents’ (the panel data), we may expect a much more accurate estimate of G. Robinson [15] constructed 
an estimator for the moments of G using sample autocovariances of Xi and derived its asymptotic properties 
as N ^ oo, whereas the length n of each sample remains fixed. Beran et al. [I] discussed estimation of two- 
parameter Beta densities g from panel AR(1) data using maximum likelihood estimators with unobservable 
Oj replaced by sample lag 1 autocorrelation coefficient of Xi(l),... ,Xi{n) (see Section]^, and derived the 
asymptotic normality together with some other properties of the estimators as N and n tend to infinity. 

The present paper studies nonparametric estimation of G from panel random-coefficient AR(1) data using 
the empirical distribution function: 


1 


N 


GN,n{^) ■= ^3;), X G 

i=l 


(1.3) 


where ai^n is the lag 1 sample autocorrelation coefficient of Aj, z = 1,..., iV (see (3.3) below). We also discuss 
kernel estimation of the density g{x) = G'{x) based on smoothed version of (1.3). We assume that individual 
AR(1) processes Xi are driven by identically distributed shocks containing both common and idiosyncratic 
(independent) components. Consistency and asymptotic normality as A, n —>■ oo of the above estimators 
are derived under some regularity conditions on G{x). Our results can be applied to test goodness-of-fit of 
the distribution G{x) to a given hypothesized distribution (e.g., a Beta distribution) using the Kolmogorov- 
Smirnov statistic, and to construct confidence intervals for G{x) or g{x). 

The paper is organized as follows. Section 2 obtains the rate of convergence of the sample autocorrelation 
coefficient to a*, in probability, the result of independent interest. Section 3 discusses the weak convergence 
of the empirical process in (1.3) to a generalized Brownian bridge. The Kolmogorov-Smirnov goodness-of-fit 
test for simple and composite hypotheses of Beta distributed a is discussed in Section 4. In Section 5 we study 
kernel density estimators of g{x). We show that these estimates are asymptotically normally distributed and 
their mean integrated square error tends to zero. A simulation study of Section 6 compares the empirical 


performance of (1.3) and the parametric estimator of [T| to the goodness-of-fit testing for G{x) under null 
Beta distribution. The proofs of auxiliary statements can be found in the Appendix. 

In what follows, G stands for a positive constant whose precise value is unimportant and which may change 
from line to line. We write —)-p, —?-d, —>-fdd for the convergence in probability and the convergence of (finite- 
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dimensional) distributions respectively, whereas => denotes the weak convergence in the space D[— 1 , 1 ] with 
the supremum metric. 


2 Estimation of random autoregressive coefficient 

Consider an RCAR(l) process 

X{t) = aX{t-l)+C{t), t€Z, (2.1) 

where innovations {C( 0 } admit the following decomposition: 

C{t) = br]{t)t£Z, ( 2 . 2 ) 

where random sequences {?y(t)}, {C(^)} and random coefficients a, b, c satisfy the following conditions: 
Assumption Ai {ri{t)} are independent identically distributed (i.i.d.) random variables (r.v.s) with E 7 y( 0 ) = 
0 , E 7 y^( 0 ) = 1 , E| 77 ( 0 )pP < oo for some p > 1. 

Assumption A2 {'^(t)} are i.i.d. r.v.s with E,^(0) = 0, E,^^(0) = 1, E|^(0)p^ < 00 for the same p as in Ai. 
Assumption A 3 b and c are possibly dependent r.v.s such that Pr( 6 ^ + c^ > 0) = 1 and E 6 ^ < 00 , Ec^ < 00 . 
Assumption A4 a G (—1,1) is a r.v. with a distribution function (d.f.) G{x) := Pr(a < x) supported on 
[— 1 , 1 ] and satisfying 

1 \ ^ dG{x) 

1 — jaj / 1 — jxj 


E 


< 00 . 


(2.3) 


Assumption A 5 a, {r/(t)}, {'^(t)} and the vector {b,c)~^ are mutually independent. 


Remark 2.1 In the context of panel observations (see (3.1) below), {r]{t)} is the common component and 
{^(t)} is the idiosyncratic component of shocks. The innovation process {C(0} ia (2.2) is i.i.d. if the coefficients 
b and c are nonrandom. In the general case {C(^)} is a dependent and uncorrelated stationary process with 
EC(0) = 0, EC2(0) = E62 + Ec 2, EC(0)C(t) = 0, t / 0. 


Under conditions A 1 -A 5 , a unique strictly stationary solution of (2.1) with finite variance exists and is 
written as 


X{t) = Y,a^-X{s), tGZ. 


(2.4) 


S<t 

Clearly, EA(t) = 0 and EA^(t) = E((^(0)E(1 — a^)“^ < 00 . Note that (2.3) is equivalent to 


E 


ir^) 


<00, 1 < p < 2, 


.1 - |a|P. 

since 1 — jaj < 1 — \a\P < 2(1 — jaj) for a £ (— 1 , 1 ). 

For an observed sample A(l),..., A(n) from the stationary process in (2.4), define the sample mean 
Xn := n~^ the sample lag 1 autocorrelation coefficient 


Ztl\X{t)-Xn){X{t + l)-Xn) 
Trt=i{x{t) - XnY 


dn • — 


(2.5) 


Note the estimator an in (2.5) does not exceed 1 a.s. in absolute value by the Cauchy-Schwarz inequality. 


Moreover, it is invariant to shift and scale transformations of {A(t)} in (2.1), i.e., we can replace {A(t)} by 
{pX{t) + /i} with some (unknown) p gU. and p > 0. 
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Proposition 2.1 Under Assumptions Ai~A^, for any 0 < 7 < 1 and n > 1, it holds 

Pr(|an-a|>7) < (7(n-«^’/2)A(p-i))^-p ^ 


with C > 0 independent o/n, 7 . 

Proof. See Appendix. 

Assume now that the d.f. G{x) = Pr(a < x) satisfies the following Holder condition: 
Assumption As There exist constants Lq > 0 and g G (0,1] such that 

\G{x)-G{y)\ < Lclx-yl^, x,yG[-l,l]. 


Consider the d.f. of 

Gn{x) := Pr(o„ < x), x G M. 

Corollary 2.2 Let Assumptions Ai-Aq hold. Then, as n ^ 00 , 

sup |Gn(x) — G{x)\ = 
a;e[—1,1] 


Proof. Denote 6n ■= a-n — a. For any (nonrandom) 7 > 0 from ( 2 . 6 ) we have 

sup \Gn{x) - G{x)\ = sup I Pr(a + < x) - Pr(a < x)| < Lg 7 ^ + Pr(|(5„| > 7 ), 

a:e[—1,1] a:e[—1,1] 


( 2 . 6 ) 

(2.7) 


implying 

sup |G„(x)-G(x)| < 7 . 07 "^ + C(n-i + n-((P/ 2 )A(p-i))^-P) 
xe[—1,1] 

with C > 0 independent of n, 7 . Then the corollary follows from Proposition |2.1| by taking 7 = 7 n = o(l) 
such that 7 n ~ and noting that the exponent ^^(f U {p — 1 )) < 1 . □ 


3 Asymptotics of the empirical distribution function 


Consider random-coefficient AR(1) processes {Ai(t)}, i = 1, 2,..., which are stationary solutions to 

Xi{t) = aiXi(t — 1) + C,i{t), t G Z, 


(3.1) 


with innovations {Ci(i)} having the same structure as in ( 2 . 2 ): 


Ci{t) = hvit) + Ciii{t), t G Z. 


(3.2) 


More precisely, we make the following assumption: 

Assumption B {^(t)} satisfies Ai; {Ci(^)}) {bi-,Ci)^, a*, i = 1,2,..., are independent copies of {C(^)}) 
( 6 , c)^, a, respectively, which satisfy Assumptions A 2 -Ae. (Note that we assume A 5 for any i = 1 , 2 ,....) 


Remark 3.1 The individual processes Xi have covariance long memory if conditions (2.3) and |1 — 
x|“^dG(x) = 00 hold, which is compatible with Assumption B. The same is true about the limit aggregated 
process in (1.2) arising when the common component is absent. On the other hand, in the presence of the 
common component, long memory in the limit aggregated process arises when the individual processes have 
infinite variance and condition (2.3) fails, see 
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Define the sample mean Xi^n ■= n ^ the corresponding sample lag 1 autocorrelation coefficient 

- Xi,n){Xiit + 1) - Xi^n) 


^i,n • — 




l<i< N 


and the empirical d.f. 


1 ^ 

GnA^) '■= ^ < a:), xG 


(3.3) 


(3.4) 


2=1 


Recall that (3.4) is a nonparametric estimate of the d.f. G{x) = Pr(aj < x) from observed panel data 
{Xi{t), t = 1,... ,n, i = 1,..., N}. In the following theorem we show that GnA^) asymptotically 

unbiased estimator of G{x), as n and N both tend to infinity, and prove the weak convergence of the 
corresponding empirical process. 


Theorem 3.1 Assume the panel data model in (3.1 )-i 


sup \EGNAA~Gix)\ = 0{n 
xe[—1,1] 


Let Assumption B hold and N,n ^ oo. Then 

(3.5) 


If, in addition, 


M = ,(^^(fA(p-l)))^ 


(3.6) 


then 

NG\GNAA-Gix)) ^ W{x), xG[-l,l], (3.7) 

where {W{x), x G [—1,1]} is a continuous Gaussian process with zero mean and cov{W{x),W{y)) = G{x A 
y)-G{x)G{y). 


Proof. Note i = are identically distributed, in particular, EGnAA — Gn{x) with Gn{x) 


defined in (2.7). Hence, (3.5) follows immediately from Corollary 2.2 


To prove the second statement of the theorem, we approximate Gj^AA by the empirical d.f. 

1 ^ 


Gn{x) ■■= — ^l(ai<x), xG[-1,1] 


2 = 1 


of i.i.d. r.v.s ai,i = l,...,iV. We have — G{x)) = N^/^(GAr(x) — G{x)) + DjsiAx) with 

DnAA ■— X^^‘^{GnAA ~ Gn{x))- Since Ag guarantees the continuity of G, it holds 


iVi/2(G7v(x)-G(x)) ^ W(x), xG[-l,l] 


by the classical Donsker theorem. Then ( |3.7| ) follows once we prove sup 3 ,g[_i |DAr^„(x)| —>-p 0. By definition, 

N 

DnAA = ^(l(ai + 5i^n <x)- 1 ( 0 * < x)) = D'^AA - D'^AA^ 

2=1 

where 5i^n '■= A,n — CLi, i = 1,..., N, and 

N 

G>'nAA '■= l(x < g* < X - 5 i^n, di,n < 0 ), 

2=1 

N 

G>nAA '■= - 6i^n < aj <x, <5*,** > 0)- 

2=1 
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For 7 > 0 we have 


N 


N 


< N < a, < x + 7 ) + ^ l(|(5i,n| > 7 ) =: Vl^{x) + Vj 


n 

N,n- 


2 = 1 


(Note that does not depend on x.) By Proposition 


2.1 


2=1 

we obtain 


N 


E17,„ = Pr(|(5i,n| > 7 ) < 


2 = 1 


which tends to 0 when 7 is chosen as 7 ^+^ = n dp/ 2 )A(p 1 )) g_ ]\jext, 

VM = N^/^{Gn{x + 7)-Gn{x)) = N^/\G{x + j)-G{x)) + Un{x,x + j], 

UNix,x + ^] := N^^‘^{Gn{x +'y) - G{x + j)) - N^^‘^{Gn{x) - G{x)). 

The above choice of 7 ^+P = implies supj.g[_;^ g + 7 ) — G(a;)| = = o(l), 


whereas Un{x,x + 7 ] vanishes in the uniform metric in probability (see Lemma 7.2 in Appendix). Since 
D'^nix) is analogous to D'^^{x), this proves the theorem. □ 

Remark 3.2 (3.6) implies that n ^ j^{s+p)/ev asymptotically for p > 2. Note that {g + p)/gp > 1 and 
limp^oo(^’+p)/^>P = l/^? for any g G (0,1]. We may conclude that Theorem 


3.1 


as well as other results of this 


paper apply to long panels with n increasing much faster than N, except maybe for the limiting case p = 00 


for g = 1. The main reason for this conclusion is that a* need to be accurately estimated by (3.3) in order 


that GN,n{x) behaves similarly to the empirical d.f. Gn{x) based on unobserved autocorrelation coefficients 
(li,! < i < N. 


4 Goodness-of-fit testing 


Theorem 3.1 can be used for testing goodness-of-fit. In the case of simple hypothesis, we test the null 
Hq : G = Go vs. Hi : G ^ Gq with Go being a certain hypothetical distribution satisfying the Holder condition 


in (2.6). Accordingly, the corresponding Kolmogorov-Smirnov (KS) test rejecting Hq whenever 


sup \GN,nix) - Go(x)| > c(a;) 


(4.1) 


has asymptotic size u G (0,1) provided N, n, Go satisfy the assumptions for (3.7) in Theorem 3.1 (Here, c{uj) 
is the upper w-quantile of the Kolmogorov distribution.) However, the goodness-of-fit test in ( |4.1[ ) requires 
the knowledge of parameters of the model considered, which is not typically a very realistic situation. Below, 
we consider testing composite hypothesis using the Kolmogov-Smirnov statistic with estimated parameters. 
The parameters will be estimated by the method of moments. 

Write p = (/r(^),..., and pN,n = ■■■, where 

p(^) ■= Ea“ = / x“dG(x), := — 


'-1 


.n)“, 


1 < tt < m. 


2=1 


Proposition 4.1 Let the panel data model in (3.1)-(3.2) satisfy Assumption B with exception of Assump¬ 
tion Aq. If N = 


as N,n ^ 00 , then 
N^^‘^{pN,n — p) —A/’(0,i;), where S := (cov(a“, a’')) 


l<22,2;<m* 


(4.2) 
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Proof. Write 


N^^‘^{V‘N,n- P) = N^^‘^{V-N,n-V‘N) + - IJ), 

where fiM '■= jqYM=ii^ii ■ ■ ■ have — p) —>-(1 AA(0, S) as A/' —>■ oo by the multivariate 

central limit theorem. On the other hand, N^^‘^{fiN,n — Pn) — 0 follows from E|a“ — aP\ < CEla,! — a| < 
( 7(7 + Pr(|on — a| > 7 )) and Proposition 2.1 with 7 ^+P = proving the proposition. □ 


Remark 4.1 Robinson |15( Theorem 7] discussed a different estimate of /r, which was proved to be asymptot¬ 
ically normal for fixed n as —?• 00 in contrast to ours. However, his result holds in the case of idiosyncratic 


innovations only and under stronger assumptions on G than in Proposition 4.1 which do not allow for long 
memory. 

Consider testing the composite null hypothesis that G belongs to the family Q = {Gg, 6 = {a,/3)'^ G 
( 1 , 00 )^} of Beta d.f.s versus an alternative G ^ Q, where 

1 


Geix) = 


xG[0,l] 


B{a,/3) Jo 

and B(a!,/3) = r(Q;)r(/ 3 )/r(a + /3) is Beta function. The uth moment of Gg is given by 

„i u—l 

lo 


(4.3) 


= / x“dG,(x)= n 


r=0 


a + r 
a + /3 + r' 


Parameters a,/3 can be found from the first two moments /U = {p^^\ as 


a = 


^{l)(^(l)_^( 2 )) 
^( 2 )_(^( 1))2 ’ 


/? = 


(l-^W)(^(l)-^( 2 )) 
^(2) _ (^(1))2 


(4.4) 


The moment-based estimator '■= {o(N,n, f3N,n)~^ of 0 = {a,P)'^ is obtained by replacing fj, in (4.4) by its 
estimator fiN^n- The consistency and asymptotic normality of this estimator follows by the Delta method 


from Proposition |4.1[ see Corollary 4.2 below, where we need condition a,/3 > 1 to satisfy Assumptions A 4 
and Ag. 


Corollary 4.2 Let the panel data model in (3.1)-(3.2) satisfy Assumption B. Let G = Gg, 9 = {a,fdy be a 


Beta d.f. in (4.3), where a > 1, (3 > 1. Let N,n increase as in (3.6) where g =1. Then 


- 0) AA(0, Ae), Kg := A-iS(A-i)', 

where S is the 2x2 matrix in ( |4.2[ ) and 

fdp.^^'>/da dgG)/df3\ 


(4.5) 


A := dfi/d9 = 


\dp^^^/da 


Moreover, 9]\f,n is asymptotically linear: 

N 


N^^‘^{9N,n- 0) = N G‘^'^lg{ai) + Op{l), lg{x) := A ^(a: - 3.2 _ ^( 2 )^T^ 

i=l 

where 'Elg{a) = lg(x)dGg(x) = 0 and Elg(a)lg(a}~’~ = lg(x}lg(x)~’~dGg(x) = Ag. 


(4.6) 
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Corollary 4.3 Let assumptions of Corollary 4-2 hold. Then 

N^^'^{GN,n{x) - G^^Jx)) => Vg{x), xG[0, 1], 
where {Vg{x), x G [0,1]} is a continuous Gaussian process with zero mean and covariance 
co\{Ve{x),Ve{y)) = Gg{x Ay) - Ge{x)Gg{y) 


- le{u)'dGgiu)deGe{y)- lg{u)'dGe{u)deGe{x) + deGe{x)'AgdeGgiy), 
Jo Jo 


where dgGQ{x) :=dGg{x)/d9= (^dGg{x)/da,dGg{x)/df3^~^, x£ [0,1] and Kg is defined in (4.5) 


Proof. The d.f. Gg with a > 1, fi > 1 satisfies Assumptions A 4 and Ae with g = 1. Recall G]sf{x) := 
Yl^=i < x), X G [0,1]. Since condition (3.6) is satisfied, so A^^/^ sup^-gjo,!] \GN,n{x)— Gn{x)\ vanishes 
in probability by Theorem 3.1, whereas the convergence N^G{Qj^(^x) — G^^ (x)) ^ Ve{x), x G [0,1] follows 
from (4.6) using the fact that dgGg{x), x G [0,1] is continuous in 6, see [7j or [20l Theorem 19.23]. □ 


With Corollary 4.3 in mind, the Kolmogorov-Smirnov test for the composite hypothesis G £ Q can be 
defined as 


sup iV^/^|GAr,n(x) - Gg-^^(x)| > c{uj,eN,n) 


a:e[0,l] 


(4.7) 


where c{u},9) is the upper w-quantile of the distribution of sup 3 ,g[o,i] l^ 6 i(x)|: 

Pr f sup \Vg{x)\ > c{uj,9)'\ = 00 , a;G(0, 1). 

^a:e[ 0 ,l] ^ 


The test in (4.7) has correct asymptotic size for any uj G (0,1), which follows from Corollary |4.3| and the 
continuity of the quantile function c{u},9) in 9, see [T9l p. 69], [ 20 ]. By writing A'^/^(GAr,n(x) — G^^ (x)) = 
N^GfQj^ (x) — G(x)) + N^^^(G{x) — Gz (x)), it follows that the Kolmogorov-Smirnov statistic on the l.h.s. 

_ ’ ^N,n 

of (|^ tends to inhnity (in probability) under any fixed alternative G ^ Q which cannot be approximated 
by a Beta d.f. Gg in the uniform metric, i.e., such that inf^ sup^-gjo,!] \G{x) — Gg{x)\ > 0. Moreover, even 
under the alternative, we preserve the consistency of fiN,n, hence c{LO,9]s[,n) being a continuous function of 


sample moments, converges in probability to some finite limit. Therefore the test (4.7) is consistent. 


In practice, the evaluation of c(uj, 9) requires Monte Carlo approximation which is time-consuming. Alter¬ 
natively, [TSl [T9| discussed parametric bootstrap procedures to produce asymptotically correct critical values. 
We note that the assumptions of m Theorem 1] are valid for the family of Beta d.f.s and the moment-based 
estimator of 9 in Corollary |4.3[ The consistency of the test when using bootstrap critical values follows by a 


similar argument as in (4.7). 


5 Kernel density estimation 

In this section we assume G has a bounded probability density function g{x) = G'(x),x G [—1,1], implying 


Assumption Ag with Holder exponent = 1 in (2.6). It is of interest to estimate g{x) in a nonparametric 


way from ai ,„,... ,aN,n ( |3.3[ ). 

Consider the kernel density estimator 


1 ^ 

?iV.n(x) ■= 


2=1 


X Cli', 


h 


X G 


(5.1) 














where X is a kernel, satisfying Assumption A 7 and h = hN,n is a bandwidth which tends to zero as N and n 
tend to infinity. 

Assumption A 7 K : [—1,1] —>■ M is a continuous function of bounded variation that satisfies K(x)dx = 
1. Set ||A||| := K(x)^dx and := x^K(x)dx and A(x) := 0, x G M \ [—1,1]. 

We consider two cases separately. 


Case (i) Pr(5i = 0) = 1, meaning that the coefficient bt = 0 for the common shock in (3.2) is zero and that 
the individual processes {Aj(t)}, i = 1 , 2 ,..., are independent and satisfy 

Xi{t) = aiXi{t - 1) + Ci^i{t), t E Z. 


Case (ii) Pr(6i 7^ 0) > 0, meaning that {Xj(t)}, i = 1,2,..., are mutually dependent processes. 
Proposition 5. 1 Let Assumptions B and A 7 hold. If —)■ 00 , then 

EgN,n{x) gix) 

at every continuity point x E M of g. Moreover, if 

j7,(p/2)a(p-i)^i+p qq Case (i), 

7j(p/2)A(p-i)|'^y'jY)i+p — 00 in Case (ii), 

then 

j9{xi)\\K\\l if xi=X2, 


(5.2) 


(5.3) 


Nhcov{gN,n{x\),gN,n{x2)) 




0 


if Xi ^ X2 


(5.4) 


at any continuity points X\,X 2 EM of g. If Nh — )• 00 holds in addition to (5.3), then the estimator gN,n{x) 
is consistent at each continuity point x E M.' 


^9N,nix) - g{x)\‘^ 0 . 


(5.5) 


Proof. Throughout the proof, let Kh{x) := K{x/h), x E M. Consider (5.2). Note ^gN,n{x) = h ^EA/i(x — 


On), because di^n, i = 1, ■ ■ ■, N, are identically distributed. Let us approximate gN,n{x) by 

N 




Nh 


x — aj), X E 


(5.6) 


i=l 


which satisfies E^ 7 v(x) = h ^EA/i(x — a) —)• g{x) as /i —)• 0 at a continuity point x of g, see [13]. Integration 
by parts and Corollary |2.2| yield 


(5.7) 


h\EgN^n{.x) - E 5 Ar(x)| = / (G„(y) - G{y))dKh{x - y) 


< V{K) sup |Gn(?/)-G(y)| = C>(n-((P/ 2 )A(p-i))/(i+p)^^ 
y6[-l,l] 


uniformly in x E M, where V{K) denotes the total variation of K and V{K) = V{Kh). This proves (5.2). 
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Next, let us prove (5.4). We have 


NhCOY{gN{xi),gN{x 2 )) = ^ ^Kh{xi - a)Kh{x 2 - a) 


{g{xi)\\K\\l ifxi = X2, 

I 0 if xi 7^ X2, 


(5.8) 


as /i —)■ 0 at any points xi,a ;2 of continuity of g, see m- Split Nh{cOY{gN,n{xi),gN,ni^2)) - 
cov{gN{xi),gN{x 2 ))} = J2i=i Qiixi,X 2 ), where 


Ql{xi,X 2 ) 

Q2{xi,X2) 

Q3(xi,X2) 


:= h~^{EKh{xi - an)Kh{x2 - an) - EKh{xi - a)Kh{x2 - a)}, 

:= h~^{EKh{xi - an)EKh{x2 - an) - EKhixi - a)EKh{x2 - a)}, 
:= {N - l)h~^ coY{Kh{xi - ai^n),Kh{x2 - 02 ,n))- 


Note Q 3 {xi,X 2 ) = 0 in Case (i). Similarly to (5.7), 

\Qi{xi,X 2 )\ = h~^ [ {Gn{y) - G{y))dKh{xi - y)Kh{x 2 - y) 




since V{Kh{xi — ■)Kh{x 2 — ■)) < G and \Q 2 {xi,X 2 )\ < Gh dvG)/^{v i))/(i+p) g uniformly in X\,X 2 - 
Finally, 


\Q3{xi,X2)\ 
N - 1 


< 


h 

GN 

~h~ 


(Pr(ai,n < 2/1,02,n < 2/2) - Pr(ai,n < 2/1) Pr(a2,n < y2))dKh{xi - yi)dKhix2 - 2/2) 


sup I Pr(ai,n < 2/1,02,n < 2/2) - Pr(ai,n < 2/1) Pr(a2,n < 2/2)! 

yi ,?;2 6 [-i,i] 


= 0(Afh-in-((P/2)A(p-i))/(i+p)) ^ 


proving (5.4) and the proposition. 


□ 


Remark 5.1 It follows from the proof of the above proposition that in the case of a (uniformly) continuous 


density g{x),x G [—1,1], relations (5.2), (5.5) and the first relation in (5.4) hold uniformly in x G M, implying 
the convergence of the mean integrated squared error: 

/ OO 

^gN,ni^) - g{x)\'^ dx 0. 

-00 

Proposition 5.2 (Asymptotic normality) Let Assumptions B and Ay hold and assume Nh 00 in addition 


to (5.3). Moreover, let K be a Lipschitz function in Case (ii). Then 

gN,n{x) - EgN^nix) 


^/vaic{gN,nix)) 

at every continuity point x G (—1,1) of g. 


—^-d AA(0,1) 


(5.9) 


Proof. First, consider Case (i). Since gN,n{x) = (Nh) is a (normalized) sum of i.i.d. r.v.s 

Vi^N ■ = Kh{x — tti^n) with common distribution Vn '■= Fl.aT) h suffices to verify Lyapunov’s condition 


E\Vn-EVn\^+^ 

NG2 {var(V)v)}( 2 +'^P 2 


0 , 


(5.10) 
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for some 5 > 0. This follows by the same arguments as in m- Analogously to Proposition |5.1[ we 


have ElVjvP"'''^ = — ~ hg{x) \K{y)\‘^~^^di.y = 0{h) while var(VAr) = Nh? Ya,Y(gN^n{x)) ~ 

hg{x)\\K \\2 according to ( |5.4[ ). Hence the l.h.s. of ( |5.10[ ) is 0{{Nh)~^/‘^) = o(l), proving (5.9) in Case (i). 

Let us turn to Case (ii). It suffices to prove that \/ Nh{gN,nix) — gN{x)) —)-p 0, for gNix) given in (5.6). 
By \K{x) - K{y)\ < Lk\x - y\, G M, for e > 0 


Pr 


[■ (yNh\gN,nix) - ?Ar(x)| 


>e < 






V\/M ^ h 

2 = 1 

< A'Pr f |a„ — a| > 

< C 


>e 


NJ LkJ 

+ = 0 ( 1 ) 
\h J n J 


from Proposition 2.1 and (5.3) with Nh —)• oo. 


Corollary 5.3 Let assumptions of Proposition 5.2 hold with h ~ cN for some c > 0, i.e., 

Case (i), 

5^lP, 


N = 


^))) Case (ii). 

Moreover, let g G C'^[—1,1] and yK{y)dy = 0. Then 

N‘^^^{9N,n{x) - g{x)) -^d M{ji{x), (T^(x)), 


where g.{x) := {cP/2)g”{x)g. 2 {K) and a^(x) := (l/c) 5 f(x)||iL| 


2 - 


□ 


Proof. This follows from Proposition 5.2, by noting that Eg]Y(x) — g(x) rsj hpg"{x) pi 2 {K) / 2 as h —?■ 0 and 
E?7v,n(x) - Eg^ix) = 0(/i-in-((P/2)A(p-i))/(i+p)) (j5^_ □ 


6 Simulations 


In this section we compare our nonparametric goodness-of-fit test in (4.1) for testing the null hypothesis 


G = Go with its parametric analogue studied in jT]. In accordance with the last paper, we assume {Xi(t)} in 


(3.1) to be independent AR(1) processes with standard normal i.i.d. innovations {Ci(0}) C(0) ~ .^(0) 1) and 
the random autoregressive coefficient Oj G (0,1) having a Beta-type density g{x) with unknown parameters 
0:=(a,/3)T: 


gix) = 


„2«-l 


(1 — x^)^ xG(0, 1), a>l,P>l. 


( 6 . 1 ) 


B(«,/3) 

Note that /3 G (1,2) implies the long memory property in {Aj(t)}. Beran et al. [T] discuss a maximum 
likelihood estimate 9N,n,K, = (S, oi 6 = {a, fd)'^ when each unobservable coefficient a* is replaced by its 


estimate di^n,K '■= min{max{aj^„, k], 1 — k} with cn^n given in (3.3) and 0 < k = re) —)• 0 is a truncation 
parameter. Under certain conditions on iV, re —>■ oo and k —)• 0, Beran et al. [U Theorem 2] showed that 


N^/\eN,n,n-eo) AA(O,A-l( 0 o)), 


( 6 . 2 ) 
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where 6q is the true parameter vector, 


m ■■= 


'0i(a) - ^^1(0: +/?) -V'i(a + /3) 
-V^i(a + /3) V'i(/3) - V’i(a + /5), 


and 'ij^i{x) := d^lnr(x)/dx^ is the Trigamma function. Based on (4.1) and (6.2), we consider testing both 
ways (nonparametrically and parametrically) the hypothesis that the unobserved autoregressive coefficients 


oi,... ,a 7 v are drawn from the reference distribution Go having density function in (6.1) with a specific 
i.e., the null G = Gq vs. the alternative G 7^ Go- The respective test statistics are 


Ti := iV^/2 sup \GnM - Go(x)| and Ts := A^(0iv,n,K - Oof A{eo){eN,n,n - Oo). (6.3) 

X 


Under the null hypothesis, the distributions of statistics Ti and T 2 converge to the Kolmogorov distribution 
and the chi-square distribution with 2 degrees of freedom, respectively, see (4.1), (6.2). 

To compare the performance of the above testing procedures, we compute the empirical distribution of 
the p-value of Ti and T 2 under null and alternative hypotheses. The p-value of observed Ti is defined as 


p{Ti) = 1 — )Ci{Ti),i = 1,2, where )Ci{y),i = 1,2 denote the limit distribution functions of (6.3). Recall that 
when the significance level of the test is correct, the (asymptotic) distribution of the p-value is uniform on 
[0,1]. The simulation procedure to compare the performance of Ti and T 2 is the following: 


Step So We fix the parameter under the null hypothesis Hq : 9 = Oq with Oq = (2, lAf. 

Step Si We simulate 5000 panels with N = 250, n = 817 for five chosen values 9 = (2,1.2)"'", (2,1.3)"'", 
(2,1.4)"'', (2,1.5)"'', (2,1.6)"'' of Beta parameters. 


Step S 2 For each simulated panel we compute the p-value of statistics Ti and T 2 . 

Step S 3 The empirical c.d.f.’s of computed p-values of statistics Ti and T 2 are plotted. 

The values of Beta parameters Oq = (2, l-d)"*^, iV, n were chosen in accordance with the simulation study 

in p. 

Fig. 1 presents the simulation results under the true hypothesis 9 = Oq with zoom-in on small p-values. We 
see that both c.d.f.’s in the left graph are approximately linear. Somewhat surprisingly, it appears that the 
empirical size of Ti (the nonparametric test) is better than the size of T 2 (the parametric test). Particularly, 
for significance levels 0.05 and 0.1 we provide the empirical size values in Table 1. 

Fig. 2 gives the graphs of the empirical c.d.f.’s of p-values of Ti and T 2 for several alternatives 9 ^ Oq. It 
appears that for /3 > /3o = 1-4: the parametric test T 2 is more powerful than the nonparametric test Ti but for 
P < Po the power differences are less significant. Table 1 illustrates the empirical power for the significance 
levels 0.05, 0.1. 


Signif. level 

5% 

10 % 

P 

1.2 

1.3 

1.4 

1.5 

1.6 

1.2 

1.3 

1.4 

1.5 

1.6 

Ti 

.532 

.137 

.049 

.208 

.576 

.653 

.223 

.103 

.315 

.702 

T 2 

.500 

.104 

.077 

.313 

.735 

.634 

.184 

.134 

.421 

.827 


Table 1: Numerical results of the comparison for testing procedure Ho'- 9 = (2,1.4)^ at significance level 
5% and 10% . The column for /3 = 1.4 provides the empirical size. 
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p-value 



Figure 1: [left] Empirical c.d.f. of p-values of Ti and T 2 under Hq : Oq = (2,1.4)"'^; 5000 replications with 
N = 250, n = 817. [right] Zoom-in on the region of interest: p-values smaller than 0.1. 


The above simulations (Fig. 1 and 2, Table 1) refer to the case of independent individual processes {Xj(t)}. 
There are no theoretical results for the parametric test T 2 , when AR(1) series are dependent. Although the 
nonparametric test Ti is valid for the latter case, one may expect that the presence of the common shock 
component in the panel data in (3.2) has a negative effect on the test performance for short series. To 


illustrate this effect, we simulate 5000 panels with AR(1) processes {Aj(t)} driven by dependent shocks in 
(3.2) with bi = b, Ci = {1 — As previously, we choose Oq = (2,1.4)"'", N = 250, n = 817 and we fix 

9 = (2,1.4)"'" to evaluate the empirical size of Ti. Fig. 3 [left] presents the graphs of the empirical c.d.f.’s of 
the p-values of Ti for 6 = 1 , 6 = 0.6 and 6 = 0 , the latter corresponding to independent individual processes 

as in Fig. 1. We see that the size of the test worsens when 6 increases, particularly when 6 = 1 and the 

individual processes are all driven by the same common noise. To overcome the last effect, the sample length 
n of each series in the panel may be increased as in Fig. 3 [right], where the choice of n = 5500 and 6 = 1 
shows a much better performance of Ti under the null hypothesis 9 = 9q = (2,1.4)"'^ and the alternative 
[9 = ( 2 ,1.5)"'^ and 9 = (2,1.6)"'') scenarios. 

In conclusion, 

1. We do not observe an important loss of the power for the nonparametric KS test Ti compared to the 
parametric approach. 

2. The KS test Ti does not require to choose any tuning parameter contrary to the test T 2 . 

3. One can use the KS test Ti under weaker assumptions on AR(1) innovations. We only impose moment 


conditions. The dependence between the series is allowed by (3.2). 
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p = 1.2 


P = 1.3 




p = 1.5 p = 1.6 




Figure 2: Empirical c.d.f. of p-values of Ti and T 2 for testing Hq : 9q = (2,1.4)"'^ under several alternatives 
of the form 9 = {2,/3)'^] 5000 replications with N = 250, n = 817. 


15 











p-value 



Figure 3: [left] Empirical c.d.f. of p-values of Ti under Hq: 9o = (2,1.4)"'" for different dependence structure 
between AR(1) series : bi = b and Ci = ^/l — b'^ and N = 250, n = 817. [right] Empirical c.d.f. of p-values 
of Ti for testing Hq : 6q = (2, lA)^. AR(1) series are driven by common innovations, i.e., bi = 1, Cj = 0, for 
9 = (2,/3)'''; 5000 replications with N = 250, n = 5500. 
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7 Appendix: some proofs and auxiliary lemmas 

We use the following martingale moment inequality. 

Lemma 7.1 Letp > 1 and > 1} be a martingale difference sequence: E[^j|^i,..., ^j-i] = 0, j = 2, 3,... 
with < oo. Then there exists a constant Cp < oo depending only on p and such that 


e|E«. 

1=1 


< Cp 


i<p< 2 , 

P> 2 . 


(7.1) 


For 1 < p < 2, inequality (7.1) is known as von Bahr and Esseen inequality, see |21) . and for j? > 2, it is a 
consequence of the Burkholder and Rosenthal inequality mm. see also [SI Lemma 2.5.2]). 

Proof of Proposition 


2.1 


Since dn in (2.5) is invariant w.r.t. a scale factor of innovations {C(^)}) w-l.g. 
we can assume 6 ^ + = 1 and EC^(O) = 1, E|C(0)pi’ < oo. Then dn — a = where 


^ni ■— — 


aX^jn) ^ E:=lxmt + 1) 

EtiXHt)-niXnr ■ E7=iXHt)-n{Xnr 


Sn3 ■ — 


Xn{X{l) + X{n)) - {Xnf{l + n(l - o)) 


Trt=iX^{t)-n{Xn? 

The statement of the proposition follows from 

Pr(|<5™| > 7 ) < C(n-i + n-^PC)Cp-^)^-P) (0 < 7 < 1, i = 1, 2, 3). 


(7.2) 


To show ( |7.2| ) for i = 1, note that 5ni = Ln/{n + Dn), where := —a(l — a^)W^(n) and Dn = Dni — Dn 2 , 
C>ni := Z)r=i((l-a^)-’^^(i)-l), Dn 2 := n(l-a^)(Xn)^- We have Pr(|(5„i| > 7 ) < Pr(|D„| > n/2) + Pr(|L„| > 
n^jT). Thus, (7.2) for i = 1 follows from 

E\DniK^ < Cn, E\Dn 2 \ < C and E\Lnf < C. (7.3) 


Consider the first relation in (7.3). Clearly, it suffices to prove it for 1 < p < 2 only. We have Dni = 
‘^D'ni + Ki. where 

n 

d;, := {l-o") ^ 

S2<si<n i=lVsi 


D: 


nl 


■= E 


x2h-*)((2(5) - 1). 


s<nt=lVs 
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We will use the following elementary inequality: for any —1 < a < l,n> l,s<n 

a^(i-^)(l _ s < 0, 


an{s) := (l-a^) Yj 


, 2 (t-s) _ 


t=lVs 


< c 


1 — 1 < s < n 

a- 2 s 2n(l — |a|)), s < 0, 


1, 1 < s < n. 

Using the independence of {C(s)} and a and inequality (twice) for 1 < p < 2 we obtain 

E\D'^,\P = E| a„(si)C(si) a*^-*^C(52)r 

si<n S2<si 


(7.4) 


< 

ceE 

|«n(si)C(si) Yj 

-* 2 ^, 


s\<n 

S 2 <Sl 


< 

ceE 

|a„(si)r \a\P^^^- 

-S 2 ) 


s\<n 

S2<Sl 


< 

CE(1 - 


Cn 


s<n 


since E(1 — |o|) ^ < oo (see (2.3)) and 1 0 :^( 5 )< Cn follows from (7.4). Similarly, since {w(s) — 1, s < 


n} form a martingale difference sequence, 

E|Z)"i|P < CE^|«„(s)|P < Cn, 


s<n 


proving the first inequality (7.3). The second inequality in (7.3) follows by noting that nX^ = 
Es<n(EILivsa*"")C(s) and 


(1 - a^)E[{nXnf\a] = 


1 -a^\2 


1 — a 




S = 1 


1 — a®\ 2 Cn 


1 — a J 1 — a 


Consider the last inequality in (17.3|) . We have \Ln\ < 1 2L'^ + L" + 11, where 


L; := (l-a^) a2("-^^)a^-^^C(si)C(^2), K := (1 - a^) J] a2("-^)(C'(s) - 1). 

S2<si<n 


s<n 


We use Lemma 


7.1 


i above. Let 1 < p < 2. Then E\L'^\p < ^^,<^{(1 - < C and 

E|L;|p < - a 2 )|a| 2 (^-"i)|a|*i-" 2 }P < CE(1 - |a|)P-2 < C. Next, let p>2. Then E|L"|p < 

<^E{E.<n 1(1 - a^)a2(^-")|2}P/2 < C and E|L;|p < CE(1 - < C, proving 

(7.3) and hence for i = 1 . 


Consider (7.2) for i = 2. We have 6 n 2 = Rnjin + Dn), where Rn ■= (1 — a^) Yl^=i ^(^)C(^ + 1) and Dn is 


the same as in (7.3). Then Pr(| 5 „ 2 | > 7 ) < Pr(|7?n| > ^7/2) + Pr(|i7„| > n/2), where 

Pr(|Zl„| > n/2) < {n/4YP^^^E\Dnir^ + {n/ 4 r^E\Dn 2 \ 

n-(p-i), l<p< 2 , 


< C 


n P > 2 , 


(7.5) 


according to (7.3). Therefore (7.2) for i = 2 follows from 

n, 1 < p < 2 , 


E\Rn\P < C 


jivC^ p > 2. 
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(7.6) 





















Since i?„ = (1 - o^) J2s<n-i C(s) EHws * C(t + 1) is a sum of martingale differences, by inequality ( |7.1[ ) 
with 1 < p < 2 we obtain 


< 

CE 1(1-a' 

n—1 

’)c(») E 

Xit + i)\^ 


s<n- 

-1 

t=lVs 


< 

GE\1 - 

E 





s<n— 

1 4=lVs 


< 

GE\1 - 

-ni: 

n—1 n—in—i 

iar>’*Eior + EEi“i’’“^‘’) 



s<0 

t=i 

S = 1 t=s 

< 

GE\1 - 


-\a\P)-^ + n{l 

— aP’)“^} < Gn, 


proving (7.6) for p <2. Similarly, using (7.1) with p > 2 we get 

n—1 


E\R„f = E^ll-a^l^’EO C(5) a^-X{t+ 1 )\p\g 

s<n—l t=l\/s 

n— 1 

il-aY{ (E[|C(s) a‘-C(t + l)r|a])"/q>’''" 


< CE 


s<n—1 


£=1VS 


n-1 


< CE|l-a2|P{ ^ ^ 

s<n—l t=l\/s 

n—1 n—1n—1 

< CE\l-a‘^\P 

S = 1 t=s 


s<0 


t=l 


< 


CE\l-a'^\P{{l-a^)-^+ n{l-a'^)-^Y^^ < CnP/^, 


proving (7.6) and (7.2) for i = 2. 


It remains to prove (7.2) for i = 3. Similarly as above, Pr(| 5 „ 3 | > 7 ) < Pr(|Q„| > n 7 / 2 ) + Pr(|Z)„| > n/2), 
where Qn ■= (1 — a‘^){Xn{X{l) + X{n)) — {Xnf‘{l + n(l — a))} and Dn is evaluated in (7.5). Thus, 
i = 3 follows from (7.5) and 

E\Qn\P < C{E\{l-a^)X\n)\P + E\{l-a^){Xnf\P + nPE\{l-a){l-a^){Xnf\P] < C. 


for 


(7.7) 


Since " )C(s), an application of the second inequality of (|7.1[) yields 


E[\nXnY\a] < C 


(1 - a^f 


(l-a 2 )(l-a )2 


+ 


E(t 


1 — a^\2\p 


S=1 


— a 


Using 1 — a"" < 1 A (n(l — a)) we obtain E|(l — a)(l — a^){Xnf‘\P < Cn~P and E|(l — a^)(X„)2|P < C'E(1 — 
a)~^n~^. Finally, E|(l — a^)X2(n)|^ < C follows by the same arguments as E|L„|p < C (see (7.3)). This 
proves (7.7), thereby completing the proof of ( |7.2[ ) and of the proposition, too. □ 

Let a,ai,... ,aN be i.i.d. r.v.s with d.f. G{x) = Pr(o < x) supported by [—1,1]. Define Gn{x) := 
— ®)) ^Nix) := N^^‘^{Gn{x) — G{x)), X G M, and coNi^) (= the modulus of continuity of 

Un) by 

u;n{S) ■= sup |UAr(y) - f7Ar(x)|, 5 > 0. 

0<y—x<5 
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Lemma 7.2 Assume that G satisfies Assumption A^. Then for all e > 0, 

e^Pr(w 7 v( 5 ) > 6 e) < {3 + 3C)Lg5^ + N-^ 


where C is a constant independent of e, 6 , N. 

Proof. As in [21 p. 106, (13.17)] we have that 

E\UN{y)-U n{x)\^\Un{z)-UN{ y)f < 3PT{a e {x,y])Fv{a e {y, z]), 

E\UN{y) - Un{x)\^ < 3Pi{aG{x,y] f + N-^Pv{a£{x,y]) 

for —l<x<y<z<l, where the second inequality treats the 4th central moment of a binomial variable. 
Now fix (5 > 0 and split [—1,1] = UjAj, where Aj = [—1 + id, —1 + (i + 1)(5], i = 0,1,..., [2/<5J — 1, 
A|^ 2 /< 5 J = [^1 + !]• According to [T71 p. 49, Lemma 1], for all e > 0, 

Pr(a;Ar((5) > 6 e) < (3 + 3(7) maxPr(o G Aj) + 

i 

where (7 is a constant independent of e, 6 , N. Lemma follows from Assumption Ag on the d.f. G of the r.v. 
a. □ 

Note that if we take 6 = 6 n = o(l), we then get Pr(ti;Ar((5) > e) —)■ 0 as A —)■ oo. 


Lemma 7.3 Let di^n,^ 2 ,n bs- given in (3.3) under Assumptions Ai-Aq with g = 1. Then for all 7 G (0,1) 
and n> 1, it holds 

sup I Pr(ai,„ < X, 02 ,n < y) - Pr(ai,n < x) Pr(a 2 ,n < y)| = 

Proof. Define 5 i,n '■= a , i,n — Oi , i = 1, 2. For 7 G (0,1), we have 

Pr(|5i,n| > 7or |(52,n| > 7 ) < Pr(|(5i,„| > 7 ) + Pr(|(52,n| > 7 ) < + n“^) 


by Proposition 2.1 Consider now 

Pr(ai,n < X, 02,n <y) = Pr(ai + 5 i,n <x, 02 + S2,n < y) 

< Pr(ai + 61 ,n <X ,02 + d 2 ,n < y, < 7> \b 2 ,n\ < l) + Pr(|<5i,„| > 7 or |52,n| > 7 )- 

Then 

Pr(ai + 5i,n <X,02 + <52,n < y, |(5l,n| < 7 , l'52,n| < l) < Pr(ai < X + 7 , 02 < y + 7 , |5l,n| < 7 ) l<^2,n| < 7) 

< G(x + 7)G(?/ + 7) 

and 

Pr(ai + 6l,n <X, 02 + 62,11 < y, |( 5 l,n| < 1, \62,n\ <7) > Pr(ai < x - 7, 02 < y - 7, |( 5 l,n| < 7 > l' 52 ,n| < 7) 

> G{x - 'y)G{y - 7) - Pr(| 5 i,n| > 7or 1^2,n| > 7)- 


From (2.6) we obtain 

|G(x± 7 )G(y± 7 )-G(x)G(y)| = |(G(x) + 0 ( 7 ))(G( 2 /) + 0 ( 7 )) - G(x)G( 2 /)| < C 7 . 
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Hence 


Pr(ai < X, 02 < y) — G'(x)G(y)| < C{'^ + n ^+n p). 


(7.8) 


In a similar way, 


Pr(ai < x) Pr(a 2 < y) - G{x)G{y)\ < Gij + 


(7.9) 


By (7.8), (7.9), the proof of the lemma is complete with 7 = 7n = o(l)) which satisfies jn ^ n ^^7n^- 

□ 
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